Model Selection

Non-commercial Research Use

# Non-commercial Research Use

DAM-3B is a 3-billion-parameter vision-language model capable of generating fine-grained local descriptions for user-specified image regions.

Safetensors English

Japanese Instructblip Alpha

A visual-language instruction-following model capable of generating Japanese descriptions for input images with optional text prompts

Transformers Japanese

Wav2vec2 Large Robust 12 Ft Emotion Msp Dim

This model is fine-tuned from Wav2Vec2-Large-Robust for speech emotion recognition, predicting values in three dimensions: arousal, dominance, and valence.

Audio Classification

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase